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o 



m 



<110> APPLICANT: Gladyshev et al . 

<120> TITLE OF INVENTION: MAMMALIAN SELENOPROTEIN DIFFERENTIALLY EXPRESSED IN TUMOR 
<130> FILE REFERENCE: 4239-56113 

<140> CURRENT APPLICATION NUMBER: US 09/676, 718A 

<141> CURRENT FILING DATE: 2000-09-28 

<150> PRIOR APPLICATION NUMBER: PCT/US99/07560 

<151> PRIOR FILING DATE: 1999-04-06 

<150> PRIOR APPLICATION NUMBER: US 60/080,850 

<151> PRIOR FILING DATE: 1998-04-06 

<160> NUMBER OF SEQ ID NOS : 19 

<170> SOFTWARE: Patentln version 3.1 

<210> SEQ ID NO: 1 

<211> LENGTH: 162 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<220> FEATURE: 

<221> NAME/KEY: SITE 

<222> LOCATION: (93).. (93) 

<223> OTHER INFORMATION: Xaa is selenocysteine 
<400> SEQUENCE: 1 

Met Ala Ala Gly Pro Ser Gly Cys Leu Val Pro Ala Phe Gly Lys Arg 

1 5 10 15 

Leu Leu Leu Ala Thr Val Leu Gin Ala Val Ser Ala Phe Gly Ala Glu 

20 25 30 

Phe Ser Ser Glu Ala Cys Arg Glu Leu Gly Phe Ser Ser Asn Leu Leu 

35 40 45 

Cys Ser Ser Cys Asp Leu Leu Gly Gin Phe Asn Leu Leu Gin Leu Asp 

50 55 60 

Pro Asp Cys Arg Gly Cys Cys Gin Glu Glu Ala Gin Phe Glu Thr Lys 
65 70 75 80 

Lys Leu Tyr Ala Gly Ala lie Leu Glu Val Cys Gly Xaa Lys Leu Gly 

85 90 95 

Arg Phe Pro Gin Val Gin Ala Phe Val Arg Ser Asp Lys Pro Lys Leu 

100 105 110 

Phe Arg Gly Leu Gin lie Lys Tyr Val Arg Gly Ser Asp Pro Val Leu 

115 120 125 

Lys Leu Leu Asp Asp Asn Gly Asn lie Ala Glu Glu Leu Ser lie Leu 

130 135 140 

Lys Trp Asn Thr Asp Ser Val Glu Glu Phe Leu Ser Glu Lys Leu Glu 
145 150 155 160 

Arg lie 

<210> SEQ ID NO: 2 
<211> LENGTH: 1244 
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81 <212> TYPE: DNA 

82 <213> ORGANISM: Homo sapiens 

84 <220> FEATURE: 

85 <221> NAME/KEY: CDS 

86 <222> LOCATION: (5).. (493) 

87 <223> OTHER INFORMATION: 

90 <220> FEATURE: 

91 <221> NAME/KEY: misc_feature 

92 <222> LOCATION: (281).. (283) 

93 <223> OTHER INFORMATION: TGA codon codes for selenocysteine 
96 <400> SEQUENCE: 2 



97 agcg 


ata 


aca 


act 


aaa 

j j i/ 


cca 


aat 




tgt 


ctg 


gtg 


ccg 


gcg 


ttt 


ggg 


eta 


98 




Met 


Ala 


Ala 


Gly 


Pro 


Ser 


Gly 

J; 


Cvs 


Leu 


Val 


Pro 


Ala 


Phe 


Gly 


Leu 


99 




1 








5 










10 










15 


101 


caa 


ttg 


tta 


ttg 


aca 


act 


ata 


Ctt 


caa 


aca 


qtq 


tct 


act 


ttt 


ggg 


gca 


102 


Arg 


Leu 


Leu 


Leu 


Ala 


Thr 


Val 


Leu 


Gin 


Ala 


Val 


Ser 


Ala 


Phe 


Gly 


Ala 


103 










20 










25 










30 




105 


era cr 
y ay 


ttt 


tea 


tea 


aaa 


aca 


tac 


aaa 


aaa 


tta 


aac 


ttt 


tct 


aac 


aac 


ttg 


106 


Glu 


Phe 


Ser 


Ser 


Glu 


Ala 


Cys 


Arcr 

y 


Glu 


Leu 


Glv 


Phe 


Ser 


Ser 


Asn 


Leu 


107 








35 










40 










45 






109 


Ctt 


tgc 


age 


tct 


tgt 


gat 


ctt 


etc 


gga 


cag 


ttc 


aac 


ctg 


ctt 


cag 


ctg 


110 


Leu 


Cys 


Ser 


S.er 


Cys 


Asp 


Leu 


Leu 


Gly 


Gin 


Phe 


Asn 


Leu 


Leu 


Gin 


Leu 


111 






50 










55 










60 








113 


gat 


cct 


gat 


tgc 


aga 


gga 


tgc 


tgt 


cag 


gag 


gaa 


gca 


caa 


ttt 


gaa 


ace 


114 


Asp 


Pro 


Asp 


Cys 


Arg 


Gly 


Cys 


Cys 


Gin 


Glu 


Glu 


Ala 


Gin 


Phe 


Glu 


Thr 


115 




65 










70 










75 










117 


aaa 


aag 


ctg 


tat 


gca 


gga 


get 


att 


ctt 


gaa 


gtt 


tgt 


gga 


tga 


aaa 


ttg 


118 


Lys 


Lys 


Leu 


Tyr 


Ala 


Gly 


Ala 


He 


Leu 


Glu 


Val 


Cys 


Gly 




Lys 


Leu 


119 


80 










85 










90 












121 


gga 


agg 


ttc 


cct 


caa 


gtc 


caa 


get 


ttt 


gtt 


agg 


agt 


gat 


aaa 


ccc 


aaa 


122 


Gly 


Arg 


Phe 


Pro 


Gin 


Val 


Gin 


Ala 


Phe 


Val 


Arg 


Ser 


Asp 


Lys 


Pro 


Lys 


123 


95 










100 










105 










110 


125 


ctg 


ttc 


aga 


gga 


ctg 


caa 


ate 


aag 


tat 


gtc 


cgt 


ggt 


tea 


gac 


cct 


gta 


126 


Leu 


Phe 


Arg 


Gly 


Leu 


Gin 


He 


Lys 


Tyr 


Val 


Arg 


Gly 


Ser 


Asp 


Pro 


Val 


127 










115 










120 










125 




129 


tta 


aag 


ctt 


ttg 


gac 


gac 


aat 


ggg 


aac 


att 


get 


gaa 


gaa 


ctg 


age 


att 


130 


Leu 


Lys 


Leu 


Leu 


Asp 


Asp 


Asn 


Gly 


Asn 


He 


Ala 


Glu 


Glu 


Leu 


Ser 


He 


131 








130 










135 










140 






133 


etc 


aaa 


tgg 


aac 


aca 


gac 


agt 


gta 


gaa 


gaa 


ttc 


ctg 


agt 


gaa 


aag 


ttg 


134 


Leu 


Lys 


Trp 


Asn 


Thr 


Asp 


Ser 


Val 


Glu 


Glu 


Phe 


Leu 


Ser 


Glu 


Lys 


Leu 


135 






145 










150 










155 









137 gaa cgc ata taa atettgetta aattttgtcc tatccttttg ttaccttatc 

138 Glu Arg He 

139 160 

141 aaatgaaata ttacagcacc tagaaaataa tttagttttg cttgcttcca ttgatcagtc 

143 ttttacttga ggcattaaat atctaattaa atcgtgaaat ggcagtatag tccatgatat 

145 ctaaggagtt ggcaagctta acaaaaccca ttttttataa atgtccatcc tectgeattt 

147 gttgatacca ctaacaaaat gctttgtaac agacttgegg ttaattatgc aaatgatagt 

149 ttgtgataat tggtccagtt ttacgaacaa cagatttcta aattagagag gttaacaaga 



49 

97 

145 

193 

241 

289 

337 

385 

433 

481 

533 

593 
653 
713 
773 
833 
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151 


cagatgatta 


ctatgcctca tgtgctgtgt gctctttgaa 


aggaatgaca 


gcagactaca 


893 


153 


aagcaaataa 


gatatactga gcctcaacag attgectget 


cctcagagtc 


tctcctattt 


953 


155 


ttgtattacc 


cagctttctt tttaatacaa atgttattta 


tagtttacaa 


tgaatgeact 


1013 


157 


gcataaaaac 


tttgtagctt cattattgta aaacatattc 


aagatcctac 


agtaagagtg 


1073 


159 


aaacattcac 


aaagatttgc gttaatgaag actacacaga 


aaacctttct 


agggatttgt 


1133 


161 


gtggatcaga 


tacatacttg gcaaattttt gagttttaca 


ttcttacaga 


aaagtccatt 


1193 


163 


taaaagtgat 


catttgtaag accaaaatat aaataaaaag 


tttcaaaaat 


c 




1244 


166 


<210> SEQ ID NO 


: 3 


























167 


<211> LENGTH: 489 


























168 


<212> TYPE: 


DNA 




























169 


<213> ORGANISM: 


Homo sapiens 




















171 


<220> FEATURE: 




























172 


<221> NAME/KEY: 


CDS 


























173 


<222> LOCATION: 


(1) 


..(489) 






















174 


<223> OTHER 


INFORMATION 
























177 


<220> FEATURE: 




























178 


<221> NAME/KEY: 


misc_feature 




















179 


<222> LOCATION: 


(277) . . 


(279) 




















180 


<223> OTHER 


INFORMATION 


: TGA codon codes for selenocysteine 






183 


<4 00> SEQUENCE: 


3 


























184 


atg 


gcg 


get 


ggg 


ccg 


agt 


ggg 


tgt 


ctg 


gtg 


ccg 


gcg 


ttt 


ggg 


eta 


v-y y 


48 


185 


Met 


Ala 


Ala 


Gly 


Pro 


Ser 


Gly 


Cys 


Leu 


Val 


Pro 


Ala 


Phe 


Gly 


Leu 


Arcr 




186 


1 








5 










10 










15 






188 


ttg 


ttg 


ttg 


gcg 


act 


gtg 


ctt 


caa 


gcg 


gtg 


tct 


get 


ttt 


ggg 


gca 


y ay 


96 


189 


Leu 


Leu 


Leu 


Ala 


Thr 


Val 


Leu 


Gin 


Ala 


Val 


Ser 


Ala 


Phe 


Gly 


Ala 


Glu 




190 








20 










25 










30 








192 


ttt 


tea 


teg 


gag 


gca 


tgc 


aga 


gag 


tta 


ggc 


ttt 


tct 


age 


aac 


ttg 


ctt 


144 


193 


Phe 


Ser 


Ser 


Glu 


Ala 


Cys 


Arg 


Glu 


Leu 


Gly 


Phe 


Ser 


Ser 


Asn 


Leu 


Leu 




194 






35 










40 










45 










196 


tgc 


age 


tct 


tgt 


gat 


ctt 


etc 


gga 


cag 


ttc 


aac 


ctg 


ctt 


cag 


ctg 


gat 


192 


197 


Cys 


Ser 


Ser 


Cys 


Asp 


Leu 


Leu 


Gly 


Gin 


Phe 


Asn 


Leu 


Leu 


Gin 


Leu 


Asp 




198 




50 










55 










60 












200 


cct 


gat 


tgc 


aga 


gga 


tgc 


tgt 


cag 


gag 


gaa 


gca 


caa 


ttt 


gaa 


ace 


aaa 


240 


201 


Pro 


Asp 


Cys 


Arg 


Gly 


Cys 


Cys 


Gin 


Glu 


Glu 


Ala 


Gin 


Phe 


Glu 


Thr 


Lys 




202 


65 










70 










75 










80 




204 


aag 


ctg 


tat 


gca 


gga 


get 


att 


ctt 


gaa 


gtt 


tgt 


gga 


tga 


aaa 


ttg 


gga 


288 


205 


Lys 


Leu 


Tyr 


Ala 


Gly 


Ala 


He 


Leu 


Glu 


Val 


Cys 


Gly 




Lys 


Leu 


Gly 




206 










85 










90 












95 




208 


agg 


ttc 


cct 


caa 


gtc 


caa 


get 


ttt 


gtt 


agg 


agt 


gat 


aaa 


ccc 


aaa 


ctg 


336 


209 


Arg 


Phe 


Pro 


Gin 


Val 


Gin 


Ala 


Phe 


Val 


Arg 


Ser 


Asp 


Lys 


Pro 


Lys 


Leu 




210 










100 










105 










110 






212 


ttc 


aga 


gga 


ctg 


caa 


ate 


aag 


tat 


gtc 


cgt 


ggt 


tea 


gac 


cct 


gta 


tta 


384 


213 


Phe 


Arg 


Gly 


Leu 


Gin 


He 


Lys 


Tyr 


Val 


Arg 


Gly 


Ser 


Asp 


Pro 


Val 


Leu 




214 








115 










120 










125 








216 


aag 


ctt 


ttg 


gac 


gac 


aat 


ggg 


aac 


att 


get 


gaa 


gaa 


ctg 


age 


att 


etc 


432 


217 


Lys 


Leu 


Leu 


Asp 


Asp 


Asn 


Gly 


Asn 


He 


Ala 


Glu 


Glu 


Leu 


Ser 


He 


Leu 




218 






130 










135 










140 










220 


aaa 


tgg 


aac 


aca 


gac 


agt 


gta 


gaa 


gaa 


ttc 


ctg 


agt 


gaa 


aag 


ttg 


gaa 


480 


221 


Lys 


Trp 


Asn 


Thr 


Asp 


Ser 


Val 


Glu 


Glu 


Phe 


Leu 


Ser 


Glu 


Lys 


Leu 


Glu 
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222 145 
224 cgc ata taa 



150 



155 



489 



225 Arg lie 

226 160 

229 <210> SEQ ID NO: 4 

230 <211> LENGTH: 136 

231 <212> TYPE: PRT 

232 <213> ORGANISM: Homo sapiens 

234 <220> FEATURE: 

235 <221> NAME/KEY: SITE 

236 <222> LOCATION: (67).. (67) 

237 <223> OTHER INFORMATION: Xaa is selenocysteine 
240 <400> SEQUENCE: 4 



242 


Ser 


Ala 


Phe 


Gly 


Ala 


Glu 


Phe 


Ser 


Ser 


Glu 


Ala Cys 


Arg 


Glu 


Leu 


Gly 


243 


1 








5 










10 








15 




246 


Phe 


Ser 


Ser 


Asn 


Leu 


Leu 


Cys 


Ser 


Ser 


Cys 


Asp Leu 


Leu 


Gly Gin 


Phe 


247 








20 










25 








30 






250 


Asn 


Leu 


Leu 


Gin 


Leu 


Asp 


Pro 


Asp 


Cys 


Arg 


Gly Cys 


Cys 


Gin 


Glu 


Glu 


251 






35 










40 








45 








254 


Ala 


Gin 


Phe 


Glu 


Thr 


Lys 


Lys 


Leu 


Tyr 


Ala 


Gly Ala 


He 


Leu 


Glu 


Val 


255 




50 










55 








60 










258 


Cys 


Gly 


Xaa 


Lys 


Leu 


Gly Arg 


Phe 


Pro 


Gin 


Val Gin 


Ala 


Phe 


Val 


Arg 


259 


65 










70 










75 








80 


262 


Ser 


Asp 


Lys 


Pro 


Lys 


Leu 


Phe 


Arg 


Gly 


Leu 


Gin He 


Lys 


Tyr 


Val 


Arg 


263 










85 










90 








95 




266 


Gly 


Ser 


Asp 


Pro 


Val 


Leu 


Lys 


Leu 


Leu 


Asp Asp Asn Gly Asn 


He 


Ala 


267 








100 










105 








110 






270 


Glu 


Glu 


Leu 


Ser 


He 


Leu 


Lys 


Trp 


Asn 


Thr 


Asp Ser 


Val 


Glu 


Glu 


Phe 


271 






115 










120 








125 








274 


Leu 


Ser 


Glu 


Lys 


Leu 


Glu 


Arg 


He 
















275 




130 










135 



















278 <210> SEQ ID NO: 5 

279 <211> LENGTH: 21 

280 <212> TYPE: DNA 

281 <213> ORGANISM: Artificial Sequence 

283 <220> FEATURE: 

284 <223> OTHER INFORMATION: Primer 

286 <400> SEQUENCE: 5 

287 atggcggctg ggccgagtgg g 21 

290 <210> SEQ ID NO: 6 

291 <211> LENGTH: 21 

292 <212> TYPE: DNA 

293 <213> ORGANISM: Artificial Sequence 

295 <220> FEATURE: 

296 <223> OTHER INFORMATION: Primer 

298 <400> SEQUENCE: 6 

299 taatatgcgt tccaactttt c 21 

302 <210> SEQ ID NO: 7 

303 <211> LENGTH: 21 
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304 <212> TYPE: DNA 

305 <213> ORGANISM: Artificial Sequence 

307 <220> FEATURE: 

308 <223> OTHER INFORMATION: Primer 

310 <400> SEQUENCE: 7 

311 tctgcttttg gggcagagtt t 21 

314 <210> SEQ ID NO: 8 

315 <211> LENGTH: 1216 

316 <212> TYPE: DNA 

317 <213> ORGANISM: Mus musculus 

319 <220> FEATURE: 

320 <221> NAME/KEY: CDS 

321 <222> LOCATION: (11).. (490) 

322 <223> OTHER INFORMATION: 

325 <220> FEATURE: 

326 <221> NAME/KEY: misc_f eature 

327 <222> LOCATION: (287).. (289) 

328 <223> OTHER INFORMATION: TGA codon codes for selenocysteine 

331 <400> SEQUENCE: 8 

332 gaccgcaggg atg gcg gca ggg cag ggt ggg tgg ctg egg cca get ctg 49 

333 Met Ala Ala Gly Gin Gly Gly Trp Leu Arg Pro Ala Leu 

334 1 -5 10 



336 


ggg 


ctg 


cgc 


ttg 


ctg 


ctg 


gcg 


act 


gcg 


ttt 


caa 


gcg 


gtg 


tct 


get 


ctg 


97 


337 


Gly 


Leu 


Arg 


Leu 


Leu 


Leu 


Ala 


Thr 


Ala 


Phe 


Gin 


Ala 


Val 


Ser 


Ala 


Leu 




338 




15 










20 










25 












340 


ggg 


gca 


gag 


ttt 


gcg 


tea 


gag 


gca 


tgc 


aga 


gag 


ttg 


ggt 


ttc 


tec 


age 


145 


341 


Gly Ala 


Glu 


Phe 


Ala 


Ser 


Glu 


Ala 


Cys 


Arg 


Glu 


Leu 


Gly 


Phe 


Ser 


Ser 




342 


30 










35 










40 










45 




344 


aac 


ttg 


etc 


tgc 


age 


tct 


tgc 


gat 


ctt 


ctt 


gga 


cag 


ttt 


aat 


ctg 


etc 


193 


345 


Asn 


Leu 


Leu 


Cys 


Ser 


Ser 


Cys 


Asp 


Leu 


Leu 


Gly 


Gin 


Phe 


Asn 


Leu 


Leu 




346 










50 










55 










60 






348 


cca 


ctg 


gac 


cct 


gtt 


tgc 


aga 


ggg 


tgc 


tgt 


cag 


gaa 


gaa 


gca 


caa 


ttt 


241 


349 


Pro 


Leu 


Asp 


Pro 


Val 


Cys 


Arg 


Gly 


Cys 


Cys 


Gin 


Glu 


Glu 


Ala 


Gin 


Phe 




350 








65 










70 










75 








352 


gaa 


acc 


aaa 


aag 


ctg 


tat 


gca 


gga 


gee 


ate 


ctt 


gaa 


gtc 


tgc 


gga 


tga 


289 


353 


Glu 


Thr 


Lys 


Lys 


Leu 


Tyr 


Ala 


Gly 


Ala 


He 


Leu 


Glu 


Val 


Cys 


Gly 






354 






80 










85 










90 










356 


aaa 


ttg 


ggg 


agg 


ttc 


cct 


caa 


gtc 


caa 


get 


ttt 


gtc 


aga 


agt 


gat 


aaa 


337 


357 


Lys 


Leu 


Gly Arg 


Phe 


Pro 


Gin 


Val 


Gin 


Ala 


Phe 


Val 


Arg 


Ser 


Asp 


Lys 




358 






95 










100 










105 










360 


CCC 


aaa 


etc 


ttc 


aga 


ggt 


eta 


cag 


ate 


aag 


tat 


gtt 


cga 


ggc 


tea 


gac 


385 


361 


Pro 


Lys 


Leu 


Phe 


Arg 


Gly 


Leu 


Gin 


He 


Lys 


Tyr 


Val 


Arg 


Gly 


Ser 


Asp 




362 




110 










115 










120 












364 


cct 


gta 


eta 


aag 


ctt 


ttg 


gac 


gac 


aac 


ggg 


aac 


att 


get 


gaa 


gaa 


eta 


433 


365 


Pro 


Val 


Leu 


Lys 


Leu 


Leu 


Asp 


Asp 


Asn 


Gly 


Asn 


He 


Ala 


Glu 


Glu 


Leu 




366 


125 










130 










135 










140 




368 


age 


ate 


etc 


aaa 


tgg 


aac 


aca 


gac 


agt 


gtg 


gaa 


gag 


ttc 


ctg 


age 


gag 


481 


369 


Ser 


He 


Leu 


Lys 


Trp 


Asn 


Thr 


Asp 


Ser 


Val 


Glu 


Glu 


Phe 


Leu 


Ser 


Glu 




370 










145 










150 










155 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:l; Xaa Pos . 93 
Seq#:4; Xaa Pos. 67 
Seq#:9; Xaa Pos. 93 
Seq#:16; Xaa Pos. 129 
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Output Set: N:\CRF4\08082002\I676718A.raw 

L:55 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 after pos.:80 
L:258 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 after pos.:64 
L:434 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:9^after pos . : 80 
L:575 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 16 after pos.:128 
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